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Abstract. The paper presents the navigation mobile walking 
robot systems for movement in non-stationary and non- 
structured environments, using a Bayesian approach of 
Simultaneous Localization and Mapping (SLAM) for avoiding 
obstacles and dynamical stability control for motion on rough 
terrain. By processing inertial information of force, torque, 
tilting and wireless sensor networks (WSN) an intelligent high 
level algorithm is implementing using the virtual projection 
method. The control system architecture for the dynamic robot 
walking is presented in correlation with a stochastic model of 
assessing system probability of unidirectional or bidirectional 
transition states, applying the non-homogeneous/non-stationary 
Markov chains. The rationality and validity of the proposed 
model are demonstrated via an example of quantitative 
assessment of states probabilities of an autonomous robot. The 
results show that the proposed new navigation strategy of the 
mobile robot using Bayesian approach walking robot control 
systems for going around obstacles has increased the robot’s 
mobility and stability in workspace. 


I. INTRODUCTION 


W alking robots, unlike other types of robots such as those 

with wheels or tracks, use similar devices for moving 
on the field like human or animal feet. A desirable 
characteristic a mobile robot must have the skills needed to 
recognize the landmarks and objects that surround it, and to 
be able to localize itself relative to its workspace. This 
knowledge is crucial for the successful completion of 
intelligent navigation tasks. But, for such interaction to take 
place, a model or description of the environment needs to be 
specified beforehand. If a global description or measurement 
of the elements present in the environment is available, the 
problem consists on the interpretation and matching of 
sensor readings to such previously stored object models. 
Moreover, if we know that the recognized objects are fixed 
and persist in the scene, they can be regarded as landmarks, 
and can be used as reference points for self localization. If 
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on the other hand, a global description or measurement of 
the elements in the environment is not available, at least the 
descriptors and methods that will be used for the 
autonomous building of one are required [1]. 

The approach of the localization and navigation 
problems of a mobile robot which uses a WSN which 
comprises of a large number of distributed nodes with low- 
cost cameras as main sensor, have the main advantage of 
require no collaboration from the object being tracked. 

The main advantages of using WSN _ multi-camera 
localization and tracking are: 

1) the exploit of the distributed sensing capabilities of the 
WSN; 

2) the benefit from the parallel computing capabilities of 
the distributed nodes. Even though each node have finite 
battery lifetime by cooperating with each other, they can 
perform tasks that are difficult to handle by traditional 
centralized sensing system.; 

3) the employ of the communication infrastructure of the 
WSN to overcome multi-camera network issues. Also, 
camera-based WSN have easier deployment and higher re- 
configurability than traditional camera networks making 
them particularly interesting in applications such as security 
and search and rescue, where pre-existing infrastructure 
might be damaged [2]. 

Robots have to know where in the map they are in order 
to perform any task involving navigation. Probabilistic 
algorithms have proved very successful in many robotic 
environments. They calculate the probability of each 
possible position given some sensor readings and movement 
data provided by the robot [5]. The localization of a mobile 
robot is made using a particle filter that updates the belief of 
localization which, and estimates the maximal posterior 
probability density for localization. The causal and 
contextual relations of the sensing results and global 
localization in a Bayesian network, and a sensor planning 
approach based on Bayesian network inference to solve the 
dynamic environment is presented. In the study is proposed 
a mobile robot sensor planning approach based on a top- 
down decision tree algorithm. Since the system has to 
compute the utility values of all possible sensor selections in 
every planning step, the planning process is very complex. 

The paper first presents the position force control and 
dynamic control using ZMP and inertial information with 
the aim of improving robot stability for movement in non- 
structured environments. The next chapter presents the 
mobile walking robot control system architecture for 
movement in non-stationary environments by applying 


Wireless Sensor Networks (WSN) methods. Finally, there 
are presented the results obtained in implementing the 
interface for sensor networks used to avoid obstacles and in 
improving the performance of dynamic stability control for 
motion on rough terrain, through a Bayesian approach of 
Simultaneous Localization and Mapping (SLAM). 


Il. DYNAMICAL STABILITY CONTROL 


The research evidences that stable gaits can be achieved by 
employing simple control approaches which take advantage 
of the dynamics of compliant systems. This allows a 
decentralization of the control system, through which a 
central command establishes the general movement 
trajectory and local control laws presented in the paper solve 
the motion stability problems, such as: damping control, 
ZMP compensation control, landing orientation control, gait 
timing control, walking pattern control, predictable motion 
control (see ICAMechS 2011, Zhengzhou [3]). 

In order to carry out new capabilities for walking robots, 
such as walking down the slope, going by overcoming or 
avoiding obstacles, it 1s necessary to develop high-level 
intelligent algorithms, because the mechanism of walking 
robots stepping on a road with bumps is a complicated 
process to understand, being a repetitive process of tilting or 
unstable movements that can lead to the overthrow of the 
robot. The chosen method that adapts well to walking robots 
is the ZMP (Zero Moment Point) method. A new strategy is 
developed for the dynamic control for walking robot 
stepping using ZMP and inertial information. This, includes 
pattern generation of compliant walking, real-time ZMP 
compensation in one phase - support phase, the leg joint 
damping control, stable stepping control and stepping 
position control based on angular velocity of the platform. In 
this way, the walking robot is able to adapt on uneven 
ground, through real time control, without losing its stability 
during walking [13]. 

Based on studies and analysis, the compliant control 
system architecture was completed with tracking functions 
for HFPC walking robots, which through the implementation 
of many control loops in different phase of the walking 
robot, led to the development of new technological 
capabilities, to adapt the robot walking on sloping land, with 
obstacles and bumps. In this sense, a new control algorithm 
has been studied and analyzed for dynamic walking of 
robots based on sensory tools such as force / torque and 
inertial sensors [3,13]. Distributed control system 
architecture was integrated into the HFPC architecture so 
that it can be controlled with high efficiency and high 
performance. 


IHI. SIMULTANEOUS LOCALIZATION AND MAPING 


A precise position error compensation and low-cost 
relative localization method is studied in [5] for structured 
environments using magnetic landmarks and hall sensors 
[6]. The proposed methodology can solve the problem of 
fine localization as well as global localization by tacking 
landmarks or by utilizing various patterns of magnetic 
landmark arrangement. The research in localization and 


tracking methods using Wireless Sensor Networks (WSN 
have been developed based on Radio Signal Strength 
Intensity (RSSI) [7] and ultrasound time of flight (TOF) [8]. 
Localization based on Radio Frequency Identification 
(RFID) systems have been used in fields such as logistics 
and transportation [9] but the constraints in terms of range 
between transmitter and reader limits its potential 
applications. Many efforts have been devoted to the 
development of cooperative perception strategies exploiting 
the complementarities among distributed static cameras at 
ground locations [10], among cameras mounted on mobile 
robotic platforms [11], and among static cameras and 
cameras onboard mobile robots [12]. Computation-based 
closed-loop controllers put most of the decision burden on 
the planning task. In hazardous and populated environments 
mobile robots utilize motion planning which relies on 
accurate, static models of the environments, and therefore 
they often fail their mission if humans or other unpredictable 
obstacles block their path. Autonomous mobile robots 
systems that can perceive their environments, react to 
unforeseen circumstances, and plan dynamically in order to 
achieve their mission have the objective of the motion 
planning and control problem [4, 9]. 
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Figure 1 Mobile robot control system architecture 


To find collision-free trajectories, in static or dynamic 
environments containing some obstacles, between a start and 
a goal configuration, the navigation of a mobile robot 
comprises localization, motion control, motion planning and 
collision avoidance. Its task is also the online real-time re- 
planning of trajectories in the case of obstacles blocking the 
pre-planned path or another unexpected event occurring. 
Inherent in any navigation scheme is the desire to reach a 


destination without getting lost or crashing into anything. 
The responsibility for making this decision is shared by the 
process that creates the knowledge representation and the 
process that constructs a plan of action based on this 
knowledge representation. The choice of which 
representation is used and what knowledge is stored helps to 
decide the division of this responsibility. Very complex 
reasoning may be required to condense all of the available 
information into this single measure [4, 14]. The techniques 
include computation-based closed-loop control, cost-based 
search strategies, finite state machines, and rule-based 
systems [17]. 

Computation-based closed-loop controllers put most 
of the decision burden on the planning task. In hazardous 
and populated environments mobile robots utilize motion 
planning which relies on accurate, static models of the 
environments, and therefore they often fail their mission if 
humans or other unpredictable obstacles block their path. 
Autonomous mobile robots systems that can perceive their 
environments, react to unforeseen circumstances, and plan 
dynamically in order to achieve their mission have the 
objective of the motion planning and control problem. To 
find collision-free trajectories, in static or dynamic 
environments containing some obstacles, between a start and 
a goal configuration, the navigation of a mobile robot 
comprises localization, motion control, motion planning and 
collision avoidance [15, 16]. A higher-level process, a task 
planner, specifies the destination and any constraints on the 
course, such as time. Most mobile robot algorithms abort, 
when they encounter situations that make the navigation 
difficult. Set simply, the navigation problem is to find a path 
from start (S) to goal (G) and traverse it without collision. 
The relationship between the subtasks mapping and 
modeling of the environment; path planning and selection; 
path traversal and collision avoidance into which the 
navigation problem is decomposed, is shown in Figure 1. 

Motion planning of mobile walking robots in uncertain 
dynamic environments based on the behavior dynamics of 
collision-avoidance is transformed into an optimization 
problem. Applying constraints based on control of the 
behavior dynamics, the decision-making space of this 
optimization. 


IV. VIRTUAL PROJECTION METHOD 


A virtual projection architecture system was designed which 
allows improvement and verification of the performance of 
dynamic force-position control of walking robots by 
integrating the multi-stage fuzzy method with acceleration 
solved in position-force control and dynamic control loops 
through the ZMP method for movement in non-structured 
environments and a bayesian approach of simultaneous 
localization and mapping (SLAM) for avoiding obstacles in 
non-stationary environments. By processing inertial 
information of force, torque, tilting and wireless sensor 
networks (WSN) an intelligent high level algorithm is 
implementing using the virtual projection method. 

The virtual projection method, presented in Figure 2, 
patented by the research team, tests the performance of 


dynamic position-force control by integrating dynamic 
control loops and a bayesian interface for the sensor 
network. The CMC classical mechatronic control directly 
actions the MS1, MSm servomotors, where m is the 
numbber of the robot’s degrees of freedom. These signals 
are sent to a virtual control interface (VCI), which processes 
them and genrates the necessary signals for graphical 
representation in 3D on a graphical terminal CGD. A 
number of n control interface functions ICF1-ICFn ensure 
the development of an open architecture control system by 
intergrating n control functions in addition to those supplied 
by the CMC mechatronic control system. With the help of 
these, new control methods can be implemented, such as: 
contour tracking functions, control schemes for tripod 
walking, centre of gravity control, orientation control 
through image processing and Bayesian interface for sensor 
networks. Priority control real time control and information 
exchange management between the n interfaces is ensured 
by the multifunctional control interface MCI, interconnected 
through a high speed data bus. 





Fig. 2. The virtual projection method 


Bayesian Interface for sensor networks. 


To determine the priors for the model parameters and to 
calculate likelihood function (joint probability) we define a 
given random variable x whose probability distribution 
depends on a set of parameters P = (P), P2, ... P,). Exact 
values of the parameters are not known with certainty, 
Bayesian reasoning assigns a probability distribution of the 
various possible values of these parameters that are 
considered as random variables. Bayes' theory is generally 
expressed through probabilistic statements as following: 
P(B | A) (1) 

P(B) 
P (A | B) is the probability of A given the event B occurs or 
the posteriori probability. Using Bayes' theory may be 
recurring, that if exist an a priori distribution (P (A) and a 
series of tests with experimental results B1, Bo,...,Bp..., 
expressed according to successive equations: 
P(B, | A) 

P(B,) 
P(B, | A) P(B, | A) 

P(B) P(B,) 


P(A| B) = P(A) x 


P(A | B,) = P(A) 
(2) 


P(A| B,,B,) = P(A) 


P(B, | A) 
P(B,) 

A posteriori distribution called also belief, is used when 
the test results are known, being obtained as a new function 
a priori. The start of operations sequences in the Bayesian 
method regards the transformation y. Recursive Bayesian 
updating is made under the Markov assumption: z, is 
independent of Z),...,Zn-; 1f we know x. 
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P(Zn| Z1,...5 22-1) 
=n P(zn| x) P(x | z1,...,2Z0-1) (3) 
=m, I] P(zi| x) P(x) 


i=l...n 
When there are no missing data or hidden variables the 
method for calculating P(B, D) for some belief-network 
structure Bs; and database D is presented in [12].Let Q be the 
set of all those belief-network structures that have a non-zero 
prior probability. We can derive the posterior probability of 
Bg given D as: 
P(B, | D) = P(B,,,D)/>B,, € QP(B,,,D) (4) 
The ratio of the posterior probabilities of two belief-network 
structures can be calculated as a ratio for belief-network 
structures B, and B,;, using the equivalence: 
P(B; | D)/ P(B; | D) = P(B,,,D)/ P(B,,D) (5) 
which we can derive that: 
P(B,,,D) = P(D| B,,)P(B,,) (6) 
Term P(B,;) represents prior probability that a process with 
belief-network structure Bs To designate the possible 
values of h, ca be used the Markov blanket method, MB(h) 
[12, 13]. Suppose that among the m cases in D there are u 
unique instantiations of the variables in MB(h). Given these 
conditions it follows that: 


P(D|Bs)=3.-Ef (Gr G) | UL PCH, | Bs Bp IF (Bs | B,)dB, 7) 


where G; is a given group contains c; case-specific hidden 
variables. Recall that u denotes only the number of unique 
instantiations actually realized in database D of the variables 
in the Markov blanket of hidden variable h. The number of 
such unique instantiations significantly influences the 
efficiency with which we can compute Equation 7. 
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Fig.3 The model with three states for the robotic system 





For any finite belief network, the number of such unique 
instantiations reaches a maximum regardless of how many 
cases there are in the database. That r denotes the maximum 
number of possible values for any variable in the database. If 
u and r are bounded from above, then the time to solve 
Equation 7 is bounded from above by a function that is 
polynomial in the number of variables n and the number of 
cases m. If u or ris large, however, the polynomial will be of 
high degree [12]. 


To model a robotic system requires considering in- 
between the two states of operating and faulting one or more 
intermediate states of partial success. In figure 3 is 
considered a robotic system characterized by three states: 
operating at full capacity (F), defect (D) and intermediate (I). 

A generalized diagram of states is shown in figure 4, 
which included three intermediate states. 
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Fig. 4. Generalized diagram of states with three 
intermediate states 


The Markov modeling technique requires to identify 
each intermediate state (in practice, more neighboring levels 
can be grouped together), to know the occupancy status of 
each component (Ti) and the number of transitions between 
states (Nij), which can calculate as follows: 





a . T, 
- occupancy probability of “1’state: P = = 
A 
KEDE . i N; 
- transition intensity from state "i" in "J": A; = a 


where: T, =} T, is analyzed time interval. 


The number of intermediate states to be modeled in order 
to obtain a more accurate assessment of the reliability group 
is necessary to consider more than one intermediate state. 
Figure 5 presents a model with six states to assess the 
predictable transitions in a robotic system. The six states of 
the system are: 

1 - operational state of robot; 

2 - landing control 

3 - balance control 

4 - advance control 

5 - wireless sensor networks (WSN) control 
6 - unpredict event 
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Fig.5. Modeling the states with possible transitions for robot 


Based on the surveillance data in operation regime of robot 
were determined transition probabilities using of the 


h: 
relationship: p, =— , where n, is the transition from state 
n 


i 


moen 


in "7" in the analysis time interval; n;is the number of all 
transitions from state "7" in any other states. 

Values of these transition probabilities are: p, = 0,247; 

Pa =9,32; p =0,125; pÐ, =0,205; Ð, = 0,103; By 
applying the method Markov chains are obtain the 
occupancy probability of the sates for the robot: P;=0,31; 
P,=0,208; P3=0,115; Py=0,205; P5=0,102; P.=0,06. 
The working diagram of the Petri network is presented in 
figure 6 (http://www-dssz.informatik.tu-cottbus.de). A token 
is assigned to P3, and is assumed that the localizer initially 
knows its position. The Warning event ts; fires when the 
localizer fails in estimating robot’s accurate position for 
several steps. Two navigation primitives can be modeled as 
P,, P2, respectively. Initially, the robot selects its motion by a 
random switch comprising the transitions t; and t) with 
corresponds to probabilities P? and P’,, respectively. The 
transition between them takes place according to the change 
of localizer states. The immediate transition t; means that the 
robot takes Contour tracking as soon as the localizer 
Warning event fires. 
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Fig.6. The Petri network diagram 


The other transition between two primitives, t) and t4, are 
modeled as timed transitions in order to express that the 
robot can change its current navigation primitive during the 
localizer Success state, if necessary. 


V. RESULTS AND CONCLUSION 


The control for walking robots is achieved by a control 
system with three levels. The first level is to produce control 
signals for motor drive mounted on leg joints, ensuring the 
robot moving in the direction required with a given speed. 
The language for this level is that of differential equations. 
The second level controls the walking, respectively it 
coordinates the movements, provides the data necessary to 
achieve progress. At this level, work is described in the 
language of algorithms types of walking. The third level of 
command defines the type of walking, speed and orientation. 
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Fig.7. Open architecture system of the walking robot 


At this level, the command may be provided by an 
operator who can use the control panel, in pursuit of its link 
with the robot, to specify the type of running and passing 
special orders (for the definition of the vector speed of 
movement). 

To maintain the platform in a horizontal position, the 
information provided by the horizontality transducers (or 
verticality) is used, that sense walking robots deviation 
platform to the horizontal position. Restoring the horizontal 
position of the platform is achieved at the expense of vertical 
movement of different legs of support, as decided by the 
block to maintain balance. Returning to the fixed height of 
the platform is achieved by using information provided by 
the height transducer of the platform and by simultaneous 
control of vertical movement of all legs in support phase. 
From the analysis performed results the effectiveness of the 
proposed control strategy for a walking robot. The position 
of each actuator is controlled by a PD feedback loop, using 
encoder like transducers. 

In HFPC control system, the PC system sends the 
reference positions to all actuators controllers 
simultaneously at an interval of 10 ms (100 Hz). Reference 
positions for the control of 18 actuators and actual positions 
on each axis robot obtained through interpolation are 
processed at an interval of 1 ms (1 kHz). Figure 1 shows the 
general configuration of the HFPC system for ZMP control 
method. The control system is distributive with multi- 
processor devices for joint control, data reception from 
transducers mounted on the robot, peripheral devices 
connected through a wireless LAN for off-line 
communications and CAN fast communication network for 
real time control. The HFPC system was designed in a 
distributed and decentralized structure to enable 
development of new applications easily and to add new 
modules for new hardware or software control functions. 
Moreover, the short time execution will ensure a faster 
feedback, allowing other programs to be performed in real 


time as well, like the apprehension force control, objects 
recognition, making it possible that the control system have 
a human flexible and friendly interface. 
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